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Abstract. The inspirals of stellar-mass compact objects into supermassive black 
holes constitute some of the most important sources for LISA. Detection of these 
sources using fully coherent matched filtering is computationally intractable, so 
alternative approaches are required. In a previous paper £Q, we outlined a detection 
method based on looking for excess power in a time-frequency spectrogram of the 
LISA data. The performance of the algorithm was assessed using a single 'typical' trial 
waveform and approximations to the noise statistics. In this paper we present results 
of Monte Carlo simulations of the search noise statistics and examine its performance 
in detecting a wider range of trial waveforms. We show that typical extreme mass 
ratio inspirals (EMRIs) can be detected at distances of up to 1-3 Gpc, depending on 
the source parameters. We also discuss some remaining issues with the technique and 
possible ways in which the algorithm can be improved. 
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1. Introduction 

Astronomical observations indicate that many galaxies host a supermassive black hole 
(SMBH) in their centre. The inspirals of stellar-mass compact objects into such SMBHs 
with mass M ~ few x 1O 5 M -1O 7 M constitute one of the most important gravitational 
wave (GW) sources for the planned space-based GW observatory LISA. The large 
parameter space of possible signals and their long duration makes detection using fully 
coherent matched filtering computationally intractable, so alternative search techniques 
are required. One option is to use a semi-coherent matched filtering algorithm and a 
preliminary analysis of that approach [2, suggests that the LISA EMRI detection rate 
will be dominated by inspirals of ~ 10 M BHs onto ~ 1O 6 M SMBHs, and could be 
as high as ~ 1000 in 3-5 years within ~ 3.5 Gpc. 

Matched filtering algorithms are computationally very expensive and it is therefore 
valuable to scope out alternatives. In a previous paper jT], we described a strategy to 
detect GWs from EMRIs by accumulating the signal power in the time-frequency (t-f) 
domain. The t-f power spectrum is produced by dividing the data into segments of equal 
duration and carrying out a Fast Fourier Transform (FFT) on each. The length of the 
data segments affects the resolution of the spectrogram and must be chosen to allow the 
evolution of the source to be followed in the t-f plane. This is a search parameter that 
should eventually be optimized, probably in a source specific way. For now we choose 
~ 2 week segments, as these provide appropriate resolution for typical EMRIs. The 
choice of segment duration does not significantly affect the computational requirements 
of the search, in contrast to the semi-coherent approach 2J which is limited to 2 — 3 
week segments by computational cost. 

The power spectrum is defined for each segment i and frequency bin k as, 



2(hr) 2 + 4 Re[/£(nf)*] + 2{nff 
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where h k denotes the Fourier amplitude of the signal, a 2 nk = 0.5Sh(f)/{dt 2 df) is the 
expected variance of the noise component at frequency bin k, characterized by Sh(f), 
the strain spectral density of the noise, dt is the sampling cadence and df is the Fourier 
bin width. We assume that the LISA noise components, rik, can be approximated by 
a stationary and Gaussian distribution, with spectral density Sh(f)- In addition to the 
instrumental noise, the LISA data stream will contain an unresolvable astrophysical 
foreground from galactic white dwarf (WD) binaries. We include the white dwarfs as a 
noise contribution in Sh in the usual way equations (52) and (54)). The instrumental 
noise spectral density is also taken from jHJ, equation (48). Including the WD foreground 
in this way assumes it is isotropic on the sky, which is not a good assumption in this 
case. For the same reason, the WD foreground will not be a stationary noise source on 
long timescales, but this is probably a reasonable approximation for few week long data 
segments. In the future, our analysis will be repeated using a more accurate realization 
of the WD distribution (e.g., |lj). Generally speaking, EMRIs are not detectable when 
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their frequencies overlap with the WD confusion background, so the results in this paper 
should not change significantly. However, the location in frequency of the confusion limit 
will be important, i.e., which WD binaries are resolvable and which are not [Sj. 

In the low-frequency regime, LISA can be regarded as consisting of two independent 
Michelson detectors which are denoted I and II [Hj. We combine the responses from 
these two detectors to produce the power spectrum. In this first analysis, we add 
the spectrograms from the I and II responses before performing the search. An 
alternative algorithm might involve searching each spectrogram separately and looking 
for coincidences. Such refinements will be investigated in the future. For the combined 
spectrogram, we calculate the power "density", p(i, k), by computing the average power 
within a rectangular box centered at each point (i, k), 



where n, / are the lengths of the box edges in the time and frequency dimensions 
respectively and m = n x / is the number of data points enclosed. To search for a 
possible signal, we vary the size of the box, and for each choice of n and I, (rij, lj) say, 
we search for any points at which p(i,k) exceeds a threshold, pj. A detection occurs 
when there is at least one point with p(i, k) > pj for at least one box size. In the first 
algorithm, we use only box sizes (nj,lj) = (2 nt ,2 n f), for all possible integer choices of 
n t and nj. The thresholds, pj, are set to ensure a small overall false alarm probability 
(FAP) for the search. This will be discussed in more detail later. 

The optimal box size for a given source will be large enough to include much of 
the signal power but small enough to avoid too much noise contribution. The optimum 
will be source specific due to the wide variation in EMRI waveforms. The inspiral of a 
0.6M Q white dwarf occurs much more slowly than that of a 10M Q black hole, so in the 
first case, the optimal box size is likely to be longer in the time dimension. GWs from 
an inspiral into a rapidly spinning black hole or from a highly eccentric inspiral orbit 
are characterized by many frequency harmonics, often close together. In that case, a 
box that is wider in the frequency dimension may perform well. In designing a search, 
a balance must therefore be struck between having sensitivity to a range of sources and 
increasing the reach of the search for a specific source. 

In section E] we discuss the statistics of this search in the absence of a signal. In 
section|3]we discuss how to assign thresholds for the different box sizes and in sectionHJwe 
illustrate the performance of the algorithm on a set of trial waveforms. We discuss some 
remaining issues and possible ways to develop time-frequency techniques in section EJ 

2. Statistics of the search in the absence of a signal 

2.1. Distribution of maximum power for one box size 

If no signal is present, then the power P(i, k) at each point in the unbinned plane is 
distributed as a \ 2 with 4 degrees of freedom. There are four degrees of freedom since 




(2) 



a=-n/2 b=-l/2 
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there are two independent Michelson responses and the real and imaginary parts of n\ 
are independent for each. Over many noise realizations, the total binned power, mp(i, k), 
at a specific point in the spectrogram will be distributed like a x 2 with 4 m degrees of 
freedom. The probability that mp(i, k) exceeds a threshold pj is therefore 1 — Q x 2 (pj) 
(where Q x z is the cumulative distribution function for a x 2 with X degrees of freedom). 
If q independent samples are drawn from such a distribution, the probability that the 
maximum, p m , is less than a specified value, p , is P(p m < po) = [Q x \ (Po)] 9 - However, 
in the binned t-f plane, mp(i, k) is not independent at nearby points since neighbouring 
bins overlap. There are at least N/m independent samples in the plane, but less than 
iV (where N is the total number of pixels in the unbinned spectrogram). In pQ, we used 
the estimate that there are N/(m/A) independent points. In practice, the distribution 
of the maximum is not analytically tractable, but it can be computed numerically using 
Monte Carlo simulations. 

From a Monte Carlo simulation of a quarter of a million noise realizations, we have 
computed the distribution of the maximum power in the absence of a signal for each box 
size used. These distributions are much more accurate than the approximations used 
in pp. The distributions were computed assuming we had 3 x 2 25 s (~ 3 years) of LISA 
data, sampled at a cadence of 8s and divided this into 2 20 s (~ 2 week) sections. This 
gives a time-frequency spectrogram with 96 points in time and 65536 points in frequency 
and hence 7 x 17 = 119 possible box sizes of the form n = 2 nt , I = 2 n f. For each noise 
realization, we computed the time-frequency spectrogram of the LISA mission and then 
calculated the maximum power in each binned spectrogram. Figure Q illustrates one 
typical distribution, with n = 8 and I = 2048. It also shows the approximate theoretical 
distribution described above, under the assumption that there are q = N, N/m or 
N/(m/4) independent samples in the t-f plane. As expected, the true distribution 
lies between the theoretical distributions for N and N/m independent points. Our first 
approximation of N/ (m/4) independent points is not particularly accurate. Assuming 
iV/(m/64) independent points gives a better approximation to the distribution in this 
case, although the shape is clearly different. This number is surprisingly large, indicating 
that boxes with significant overlap are still effectively independent. This suggests that 
the maximum is usually determined by a few unusually bright pixels, rather than sizable 
regions of high power. The effective number of independent boxes is box size specific, 
but always lies between N/m and N. 

2.2. Overall search false alarm probability 

Assigning a threshold, pj, for a given box size amounts to fixing the false alarm 
probability for that box size, FAP., (i.e., the probability that this threshold is exceeded 
by pure noise). If the binned distributions were independent, the overall false alarm 
probability for the search, FAP, would be FAP = 1 - ILj(l — FAP,,). But as this is 
not the case, we must make use of the Monte Carlo simulation results again. From 
the numerical distributions we can compute a threshold for each box size that sets the 
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Figure 1. Distribution of maximum power for box size n = 8, I = 2048, as computed 
from Monte Carlo simulations. Also shown is the theoretical distribution described 
in the text, under the assumption that there are N, iV/(m/64), N/(m/4) or N/m 
independent points in the binned plane. 



FAPj to a specified value. We then use the Monte Carlo data and count the fraction of 
pure noise realizations in which the maximum power exceeds the threshold for at least 
one box size. This is the overall search FAP. Since the t-f plane encompasses the whole 
of the LISA data stream, the FAP is the fraction of LISA missions in which we would 
expect to have a false alarm. In Figure El the search FAP is shown as a function of the 
(equal) FAP., assigned to each box size. For small FAPj, FAP~ lOOFAPj, indicating 
that effectively 100 of the 119 box sizes used are independent. 

3. Choice of thresholds 

At a fixed search FAP, a threshold choice corresponds to dividing up the FAP among 
the different box sizes. If we have no prior knowledge of the signal, the natural way 
to do this is to treat all the box sizes equally, i.e., set FAPj = constant. This can 
be done accurately by determining the thresholds from the numerical distributions, 
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Figure 2. Overall FAP of the search as a function of the FAP assigned to each box 
size. This assumes all box sizes are assigned equal FAPj. 

as in the previous section. In pQ we set thresholds to give uniform FAP^'s under 
the theoretical approximation that pP m was distributed as the maximum of N/(rrij/4) 
independent samples from a x\ mj distribution. Using that approach, the true FAPj's 
are not equal, but we determined the true FAP of the search for each threshold choice 
using a small Monte Carlo simulation. 

If we are searching for a particular source, the threshold choice can be tuned. To 
illustrate this, we use the 'typical' EMRI described in [T] (source "A"). This is the last 
three years of the inspiral of a 10 M Q black hole into a 10 6 M Q SMBH of spin a = 0.8M, 
with orbital eccentricity eo = 0.4 at the start of the observation and inclination l q = 45°. 
The frequency spread, time spread and SNR of our target waveform in principle allow 
us to guess theoretically which box size will be optimal for detecting it. However, the 
presence of noise means that the source will not always be detected in the same box 
size and indeed it will not always be the same part of the inspiral that is detected 
(different box sizes are optimal for different stages of the inspiral). To illustrate this, we 
ran a Monte Carlo simulation of the search over several hundred noise realizations with 
source A present in the data, and recorded every time a threshold was exceeded, and 
for which box size this occurred. The thresholds were set to give equal FAPj to all box 
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Figure 3. Fraction of detections in each box size when the signal A is present 
and absent. The box sizes have been numbered sequentially - boxes numbered 1- 
17 have n% = 1, rif = 1,2,4..., boxes numbered 18-34 have n t = 2, rif = 1,2,4,... etc. 
Thresholds were assigned to give equal FAPj to each box size. 



sizes. When this procedure was repeated without the signal, the number of detections 
was roughly independent of the box size, as we would expect. In Figure El we show 
the fraction of 'detections' (i.e., a threshold being exceeded) as a function of the box 
size label. It is clear that there are preferred box sizes for detecting this source, but 
more than one. For a given noise realization, the source might be detected (i.e., exceed 
the box threshold pj) with more than one box size, but these box sizes do vary from 
realization to realization. Thus, a targeted search for source A should still use a number 
of box sizes. 

In Figure H] we compare the performance of the algorithm at detecting source 
A under these various choices for the thresholds. We show the Receiver Operator 
Characteristic (ROC) curve, which plots the detection rate versus the overall search 
FAP. The detection rate was estimated by adding the signal waveform to a sequence of 
noise realizations, and running the search algorithm. We used ~ 6500 noise realizations, 
giving an accuracy in the detection rate of ~ 0.02%. 
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The theoretical threshold assignment used in PQ gives performance that is 
comparable with the true 'equal FAPj' threshold assignment. A targeted search, which 
uses only the eleven box sizes which have detection rates greater than 0.03 in Figure El 
and assigns equal FAP 3 - to each, performs significantly better than the blind search. 
However, if we use the set of box sizes tuned for source A to detect other sources, the 
performance can be worse. For example, when searching for the source "S" (described 
below) at a distance of 1.2 Gpc and at a search FAP of 1%, the detection rate using 
thresholds tuned for source A is 25% compared to 50% in the untargeted search. The 
search can also be tuned for source S, and a final data analysis scheme might involve 
using a targeted search for one type of inspiral followed by a targeted search for another 
type and so on. However, using multiple tuned searches increases the overall false alarm 
probability and eventually will do no better than a blind search. Without a much better 
knowledge of the likely parameters of EMRI sources in the LISA data, it is probably 
best to treat all box sizes equally. In the next section, we will use thresholds that give 
equal FAPj to all box sizes. 

4. Algorithm performance 

To assess the algorithm's performance, we computed ROC curves for a sequence of 
inspirals at a number of distances. The inspiral waveforms were computed using the 
'numerical kludge' approximation [HEIE], with detector modulations added using the 
low frequency approximation to the LISA response |B|. The detector response can be 
treated more accurately [101 HH H2] but the low frequency response is sufficient for this 
analysis. Our 'typical' source, "A" , has the parameters specified above and was placed at 
a sky position cos(9s) = 0.5, <ps = 1, with orientation chosen as cos(#k) = —0.5, 4>k = 4. 
The angles 9s and 0s are the ecliptic colatitude and azimuth of the source on the sky, 
while the angles Qk and <pK are the ecliptic colatitude and azimuth to which the spin 
of the SMBH would point in the Solar System frame. The other trial waveforms were 
obtained by changing one or two of the inspiral parameters. The waveform parameters 
for the inspirals considered are listed in Table For each inspiral, the initial semi-latus 
rectum, p, of the orbit was chosen such that the plunge would occur a few days after 
the end of the LISA observation, i.e., in each case we are focusing on approximately the 
last three years of inspiral. Tabled also lists the approximate signal to noise ratio that 
would be achieved in a fully coherent matched filtering search, using both Michelson 
data streams (I and II), if the source was at a distance of 1 Gpc. 

For each trial waveform we performed a Monte Carlo simulation, with the source at 
distances of 0.8, 1.2, 1.4, 2 and 3 Gpc. Source "A" was additionally placed at distances 
of 1, 1.75, 2.25 and 2.5 Gpc. In Figure El we show the ROC for detection of source "A" 
at each distance, in Figure El we show the ROCs for detection of all the sources at a 
distance of 2 Gpc and in Table El we summarize the detection rates for each source at 
each distance, if the thresholds are set to give an overall search false alarm probability 
of ~ 1%. Since this is the false alarm rate per LISA mission, we could afford to increase 
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Figure 4. ROC of the search for detection of source A at 2 Gpc under different 
assignments of the thresholds. "Equal FAP, all bins" is the threshold choice that 
assigns equal false alarm probability to all box sizes used. "Theoretical" is the threshold 
choice used in pQ, which assigns equal false alarm probability to all box sizes, assuming 
the maximum power is distributed as the maximum of 7V/(m/4) independent points 
drawn from a Xim distribution. "Equal FAP, targeted" uses a reduced number of box 
sizes, tuned for this source, and assigns equal FAPj to each. "Random" indicates 
the performance of a random search, for which the detection rate and false alarm 
probability are equal. 

this number in the final search, up to 10% or more. However, for the present we use 
1% in order to compare the performance of this technique directly to that of the semi- 
coherent matched filtering search j2]. In the latter search, a matched filtering SNR of 
~ 35 is required for detection when the overall search FAP is set at 1%. No ROC curves 
for the semi-coherent search are presently available, but the detection rate is estimated 
to be ~ 50% at threshold. 

With an overall search false alarm probability of 1%, we see from Figure 03 that 
source A is easily detectable out to 2 Gpc. The detection rate at greater distances 
decreases quite rapidly, although it is still ~ 20% at 2.25 Gpc. The absolute limit of 
the search for source A is approximately 3 Gpc, at which point we do no better than 
a random search. This compares quite favourably with the performance of the semi- 
coherent matched filtering search [2|, which in principle can detect this source out to 
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~ 4.5 Gpc in a Euclidean Universe. In fact, in [2], the reach of the search was limited at a 
redshift z ~ 1, or ~ 3.5 Gpc, by astrophysical considerations, since there is considerable 
uncertainty about the formation of EMRI sources at higher redshift. 

Varying the parameters of the inspiral has a noticeable effect on the detectability. 
The white dwarf inspirals (sources D, E and F) are essentially not detectable (they 
would be detectable very close by, but we do not expect any such sources closer than 
~ 0.5 Gpc). However, this is because these sources are weak and the SNR is so low that 
they would not even be detected by the semi-coherent technique. The intermediate mass 
black hole inspiral (source G) can be detected to great distances since it is very loud, but 
it is presently unclear whether any such sources are likely to exist fSl Ej • The inspirals 
into a lower (source B) or higher (source C) mass SMBH are less detectable, the former 
becoming marginally detectable at 2 Gpc and the latter becoming marginally detectable 
at about 2.25 Gpc. This is a consequence of the shape of the LISA noise curve. Using 
the semi-coherent technique these sources are similarly less detectable than source A, 
although they can still be detected out to ~ 3.5 Gpc. 

Varying the eccentricity (sources K-N) has a moderate effect on detectability - 
lower eccentricity sources can be detected somewhat further away. This is in contrast 
to the semi-coherent technique, for which eccentricity has little effect on detectability. 
Low eccentricity inspirals have power spread over fewer frequency components, and so 
the power is more likely to be concentrated in certain regions in the t-f plane, making 
it easier for this algorithm to detect them. Increasing the central SMBH spin (sources 
H-J) appears to increase the detectability. Inspirals into highly spinning black holes 
will be at higher frequencies (and thus over the floor of the instrumental noise) and 
will plunge closer to the central SMBH. Both of these properties increase the SNR 
of the source and therefore are likely to increase the detectability by either this t-f 
method or the semi-coherent method, although this has not been investigated for the 
latter technique. Increasing the inclination of the orbit with respect to the equatorial 
plane of the SMBH (sources O-T) also affects detectability, with lower inclination orbits 
being more detectable. The inclination is defined [Zj such that an inclination of 0° is a 
prograde equatorial orbit, while an inclination of 180° is a retrograde equatorial orbit. 
The increased detectability at low inclination is again largely due to the fact that such 
orbits plunge closer to the SMBH and thus have higher SNR. 

Finally, changing the extrinsic parameters (source position on the sky and 
inclination of the SMBH spin with respect to our line of sight) also affects detectability 
(sources ExtrinsA-ExtrinsF). By ensuring that the source is in a sky position to which 
LISA is very sensitive, and that the black hole spin is oriented favourably with respect 
to the detector, the detectability can be increased. Similarly, placing the source 
unfavourably can make detection marginal at 2 Gpc. However, the extrinsic parameters 
chosen for source A seem to represent an 'average' response performance. 
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Label 


Parameters 


Initial p/M 


SNR 


A 


See text 


10.3 


155 


B 


M = 3 x 10 5 M Q 


18.25 


119 


C 


M = 3 x 10 6 M Q 


6.5 


110 


D 


m = O.6M , M = 3 x 1O 5 M 


9.405 


14.1 


E 


m = O.6M 


5.83 


21.0 


F 


m = O.6M , M = 3 x 1O 6 M 


4.511 


15.7 


G 


m = 1OOM 


17.78 


382 


H 


a = 0.95M 


10.07 


170 


I 


a = 0.5M 


10.74 


132 


J 


a = 0.1M 


11.31 


108 


K 


e n = 


10.42 


147 


L 


e n = 1 


10.41 


150 


M 


e = 0.25 


10.385 


151 


N 


e n = 7 


9.71 


159 





i = 


9.925 


223 


P 


i = 30° 


10.1 


189 





i = 60° 


10.59 


115 


R 


i = 120° 


12.126 


57.7 


S 


t = 150° 


12.82 


79.6 


T 


i = 180° 


13.11 


87.0 


Ext r ins A 


cos(6>s) = 0.99 


10.3 


117 


ExtrinsB 


cos(^ s ) = 0.01 


10.3 


162 


ExtrinsC 


cos(^k) = 0.99 


10.3 


118 


ExtrinsD 


cos(e K ) = 0.01 


10.3 


146 


ExtrinsE 


<f) K = 0.01 


10.3 


111 


ExtrinsF 


<f) K = 2. 


10.3 


111 



Table 1. Parameters and signal to noise ratios at 1 Gpc for trial waveforms. 
Unspecified parameters are the same as source "A" , as given in the text. 

5. Conclusion 

We have presented a thorough analysis of the time-frequency method to detect EMRIs 
with LISA that was originally proposed in |T|. We have accurately computed the 
statistics of this search and examined its performance on a wide range of possible EMRI 
signals. We find that this algorithm is able to detect many different EMRI events out 
to distances of 1 — 3 Gpc, depending on the source parameters. In an untargeted search, 
a typical source can be detected at 2 Gpc with a detection rate of 60% at a search 
false alarm probability of 1%. Lower eccentricity sources, which have less frequency 
spreading, can be detected as far away as 3 Gpc with detection rates of 50% at the same 
overall FAP. The reach of the search can be extended by increasing the allowed FAP or by 
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Figure 5. ROC performance of the search for detecting the typical EMRI, source 
"A", at distances of 0.8, 1, 1.2, 1.4, 1.75, 2, 2.25, 2.5 and 3 Gpc (from uppermost 
downward). The lowermost solid line indicates the performance of a random search, 
as in Figure 0| Note that the curves for 0.8 — 1.4 Gpc lie on top of the upper axis, 
with 100% detection rate at all the false alarm rates considered. 

using a targeted search. By comparison, the semi-coherent matched filtering algorithm 
|2j can reach ~ 4.5 Gpc for an overall FAP of 1%, but at a presently undetermined 
detection rate (perhaps ~ 50%). Broadly speaking, this time frequency search has 
better than half the reach of the semi-coherent search j2] , but at a tiny fraction of the 
computational cost. This is very encouraging, given the simplicity of the technique, 
and suggests that a method like this could be a valuable first step for detecting the 
loudest EMRI events in the LISA data. The approach is very similar to the 'excess 
power' technique used in LIGO jTSj, which was designed to search for bursting signals. 
Although we have concentrated on EMRIs, this technique will also be sensitive to other 
types of source, including bursts, galactic white dwarf binaries etc. and so it may have 
many applications. 

However, one significant issue that we have not yet considered is that of confusion. 
The LISA data will be dominated by a superposition of signals from many different 
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Figure 6. ROC performance of the search for detecting the trial EMRI events at 
a distance of 2 Gpc. Each plot shows the typical inspiral source "A", and the ROC 
performance of a random search. The other trial sources have been divided among the 
four figures and have the parameters specified in Tabled 



astrophysical sources. So far, we have only examined the problem of detecting a single 
source in instrumental noise (plus an astrophysical foreground of white dwarf binaries). 
This algorithm should be able to make detections when there are multiple sources if 
the sources are reasonably well separated in time and frequency or of widely different 
signal to noise ratios, but it is not clear how it will fair on overlapping sources of 
comparable brightness. It is likely that the performance will be significantly impaired 
in such cases, due to the simplicity of the algorithm. Additionally, a shortcoming of 
this simple technique is that while it can tell us that a source is present, it does 
not provide much information about the properties or parameters of that source. In 
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0.8 Gpc 


1.2 Gpc 


1.4 Gpc 


2 Gpc 


3 Gpc 


A 


1 


1 


1 


0.60 


0.02 


B 


1 


1 


0.93 


0.04 


0.01 


C 


1 


1 


1 


0.10 


0.02 


D 


0.00 


0.00 


0.02 


0.01 


0.01 


E 


0.01 


0.00 


0.00 


0.02 


0.00 


F 


0.03 


0.02 


0.02 


0.01 


0.02 


G 


1 


1 


1 


1 


1 


H 


1 


1 


1 


0.96 


0.01 


I 


1 


1 


1 


0.17 


0.00 


J 


1 


1 


0.85 


0.02 


0.01 


K 


1 


1 


1 


1 


0.51 


L 


1 


1 


1 


1 


0.29 


M 


1 


1 


1 


1 


0.07 


N 


1 


1 


0.99 


0.22 


0.00 





1 


1 


1 


1 


0.63 


P 


1 


1 


1 


1 


0.10 





1 


1 


0.85 


0.03 


0.00 


R 


0.8 


0.02 


0.04 


0.00 


0.01 


S 


1 


0.53 


0.1 


0.02 


0.02 


T 


1 


0.96 


0.36 


0.01 


0.01 


ExtrinsA 


1 


1 


0.65 


0.02 


0.01 


ExtrinsB 


1 


1 


1 


0.94 


0.03 


ExtrinsC 


1 


1 


0.82 


0.03 


0.02 


ExtrinsD 


1 


1 


1 


0.31 


0.02 


ExtrinsE 


1 


0.99 


0.57 


0.03 


0.02 


ExtrinsF 


1 


0.99 


0.62 


0.02 


0.02 



Table 2. Detection rates for trial waveforms at various distances. Thresholds were set 
using the numerical probability distributions, and with an overall search false alarm 
probability of 1%. 

principle, a time-frequency analysis can provide at least the frequency and rate of 
frequency drift of an inspiral source, but this requires more sophisticated algorithms. 
One possibility would be to use boxes of different (i.e., non-rectangular) shapes, or to 
use a pattern recognition algorithm in the (binned or unbinned) t-f plane to search 
for tracks corresponding to inspirals. These more sophisticated techniques should also 
cope more readily with confusion. Finally, it will be instructive to test this and future 
algorithms using more accurate inspiral waveforms. The simple kludged waveforms used 
here are purely quadrupolar, and so their frequency structure is not as complex as true 
EMRI waveforms which might influence our results. 

Despite these remaining issues, the performance of this algorithm is very promising 
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and suggests that a time-frequency method of some kind is likely to be a useful first step 
in detecting EMRIs and other types of source in the LISA data. This provides a good 
incentive for developing and testing t-f algorithms in the ways suggested above and this 
work is now underway. 

Acknowledgments 

We thank Curt Cutler, Teviet Creighton, Leor Barack and Kip Thorne for critical 
discussions of this work. This work was supported in part by the Max Planck Institut 
fuer Gravitationshphysik (Albert Einstein Institut) (LW), by NASA grants NAG5-12834 
and NAG5-10707 (JG) and by St. Catharine's College, Cambridge (JG). 

References 

[1] Wen L and Gair J R 2005 Class. Quantum Grav. 22 S445 

[2] Gair J R, Barack L, Creighton T, Cutler C, Larson S L, Phinney E S and Vallisneri M 2004 Class. 

Quantum Grav. 21 S1595 
[3] Barack L and Cutler C 2004 Phys. Rev. D 69 082005 

[4] Nelemans G, Yungelson L R and Portegies Zwart S F 2001 Astronomy & Astrophysics 375 890 
[5] Timpano S E, Rubbo L J and Cornish N J 2005 preprint gr-qc/0504071 
[6] Cutler C 1998 Phys. Rev. D 57 7089 

[7] Glampedakis K, Hughes S A and Kennefick D 2002 Phys. Rev. D 66 064005 
[8] Creighton T, Gair J R, Hughes S A and Vallisneri M 2005, in preparation 
[9] Gair J R and Glampedakis K 2005, in preparation 

[10] Cornish N J and Rubbo L J 2003 Phys. Rev. D 67 022001, Erratum-ibid. D 67 029905 

[11] Rubbo L J, Cornish N J and Poujade O 2004 Phys. Rev. D 69 082003 

[12] Vallisneri M 2005 Phys. Rev. D 71 022001 

[13] Madau P and Rees M J 2001 Astrophys. J. Lett. 551 L27 

[14] Miller M C 2002 Astrophys. J. 581 438 

[15] Anderson W G, Brady P R, Creighton J D E and Flanagan E E 2001 Phys. Rev. D 63 042003 



